A local search approximation algorithm for k-means clustering*1

نویسندگان

چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Bi-Criteria Approximation Algorithm for k-Means

We consider the classical k-means clustering problem in the setting bi-criteria approximation, in which an algoithm is allowed to output βk > k clusters, and must produce a clustering with cost at most α times the to the cost of the optimal set of k clusters. We argue that this approach is natural in many settings, for which the exact number of clusters is a priori unknown, or unimportant up to...

متن کامل

Lecture 9 & 10 : Local Search Algorithm for k - median Problem ( contd . ) , k - means Problem

Let us define some notation which will help us analyze the algorithm. L := A solution (k-subset) returned by Local Search. Copt := An optimal solution for the k-median problem. We will eventually show that Cost(L) ≤ 5 · Cost(Copt). For any p ∈ P,C ⊆ P, NN(p, C) := c̄ ∈ C that minimizes d(p, ·). So d(p,NN(p, C)) = d(p, C) by definition. Also, for any C ⊆ P, c̄ ∈ C, Cluster(C, c̄) := {q ∈ P | NN(q, ...

متن کامل

Approximation Bounds for Hierarchical Clustering: Average Linkage, Bisecting K-means, and Local Search

Hierarchical clustering is a data analysis method that has been used for decades. Despite its widespread use, the method has an underdeveloped analytical foundation. Having a well understood foundation would both support the currently used methods and help guide future improvements. The goal of this paper is to give an analytic framework to better understand observations seen in practice. This ...

متن کامل

Persistent K-Means: Stable Data Clustering Algorithm Based on K-Means Algorithm

Identifying clusters or clustering is an important aspect of data analysis. It is the task of grouping a set of objects in such a way those objects in the same group/cluster are more similar in some sense or another. It is a main task of exploratory data mining, and a common technique for statistical data analysis This paper proposed an improved version of K-Means algorithm, namely Persistent K...

متن کامل

Local Search Methods for k-Means with Outliers

We study the problem of k-means clustering in the presence of outliers. The goal is to cluster a set of data points to minimize the variance of the points assigned to the same cluster, with the freedom of ignoring a small set of data points that can be labeled as outliers. Clustering with outliers has received a lot of attention in the data processing community, but practical, efficient, and pr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Computational Geometry

سال: 2004

ISSN: 0925-7721

DOI: 10.1016/s0925-7721(04)00021-5